Improving a Fundamental Measure of Lexical Association

نویسندگان

  • Gabriel Recchia
  • Paul Nulty
چکیده

Pointwise mutual information (PMI), a simple measure of lexical association, is part of several algorithms used as models of lexical semantic memory. Typically, it is used as a component of more complex distributional models rather than in isolation. We show that when two simple techniques are applied—(1) down-weighting co-occurrences involving lowfrequency words in order to address PMI’s so-called “frequency bias,” and (2) defining co-occurrences as counts of “events in which instances of word1 and word2 co-occur in a context” rather than “contexts in which word1 and word2 cooccur”—then PMI outperforms default parameterizations of word embedding models in terms of how closely it matches human relatedness judgments. We also identify which downweighting techniques are most helpful. The results suggest that simple measures may be capable of modeling certain phenomena in semantic memory, and that complex models which incorporate PMI might be improved with these modifications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Production of English Lexical Stress by Persian EFL Learners

This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...

متن کامل

The Effect of Lexical Collocational Density on the Iranian EFL Learners’ Reading Comprehension

The present study aims at investigating the effect of different levels of lexical collocational density on EFL learners’ reading comprehension. Eighty sophomore students with different levels of proficiency studying at  Zand Institute of Higher Education in Shiraz, Iran were chosen from among eighty five learners based on their score distribution on a reduced TOEFL test constructed by Education...

متن کامل

Equivalency and Non-equivalency of Lexical Items in English Translations of Nahj al-balagha

Lexical items play a key role in both language in general and translation in particular. Likewise, equivalence is a controversial concept discussed so widely in translation studies. Some theorists deem it to be fundamental in translation theory and define translation in terms of equivalence. The aim of this study is to identify the problems of lexical gaps in two translations of Nahj al-ba...

متن کامل

The Role of Lexical Inferencing and Morphological Instruction On EFL Learners' Reading Comprehension Development

This study investigated whether Lexical Inferencing (L1) and Morphological Instruction (MI) can significantly affect EFL learners’ reading comprehension, furthermore, it also examined their effects on the learners’ vocabulary retention over time. 60 homogeneous EFLlearnerswere randomly assigned to two experimental and a control groups. After the pre-test, participants of the first experimental ...

متن کامل

An investigation and comparison of lexical knowledge of deaf and hearing children

Abstract Objectives: The present study examines the lexical knowledge of deaf children in two age groups of 9-10 and 10-11 years old with two groups of normal hearing children of 9-10 and 10-11 years old. Method: This study is a casual-comparative study. The achievement of 16 deaf children (ages 9-10 and 10-11 years old) and 16 hearing children (ages 9-10 and 10-11 years old) were examined on...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017